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Abstract. Davis and Mikosch [7] introduced the extremogram as a flexible quantitative tool for 
measuring various types of extremal dependence in a stationary time series. There we showed some 
standard statistical properties of the sample extremogram. A major difficulty was the construction 
of credible confidence bands for the extremogram. In this paper, we employ the stationary bootstrap 
to overcome this problem. Moreover, we introduce the cross extremogram as a measure of extremal 
serial dependence between two or more time series. We also study the extremogram for return 
times between extremal events. The use of the stationary bootstrap for the extremogram and the 
resulting interpretations are illustrated in several univariate and multivariate financial time series 
examples. 
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2 the extremogram 

1. Introduction 

With the wild swings recently seen in the financial markets and climatic conditions, there has 
been renewed interest in understanding and modeling extreme events. The extremogram, developed 
in Davis and Mikosch 0, is a flexible tool that provides a quantitative measure of dependence of 
extreme events in a stationary time series. In many respects, one can view the extremogram 
as the extreme-value analog of the autocorrelation function (ACF) of a stationary process. In 
classical time series modeling the ACF, and its sample counterpart, are the workhorses for measuring 
and estimating linear dependence in the family of linear time series processes. While the ACF 
has some use in measuring dependence in non-linear time series models, especially when applied 
to non-linear functions of the data such as absolute values and squares, it has limited value in 
assessing dependence between extreme events. On the other hand, the extremogram only considers 
observations, or groups of observations, which are large. 

For a d-dimensional strictly stationary time series (Xt), the extremogram is defined for two sets 
A and B bounded away from bjo 

(1.1) p A B (h) = lim Pix^Xh G B | x~ x Xq 6 4 h = 0, 1, 2, . . . , 

provided the limit exists. Since A and B are bounded away from zero, the events {x _1 A"o G A} and 
{x X^ G B} are becoming extreme in the sense the probabilities of these events are converging 
to zero with x — >• oo. In the special case of a univariate time series and the choice of the sets 
A = B = (l,oo), the extremogram reduces to the (upper) tail dependence coefficient between Xq 
and Xh that is often used in extreme value theory and quantitative risk managament; see e.g. 
McNeil et al. [16]. In this case, one is interested in computing the impact of a large value of 
the time series on a future value h time-lags ahead. With creative choices of A and B, one can 
investigate interesting sources of extremal dependence that may arise not only in the upper and 
lower tails, but also in other extreme regions of the sample space; see Sections H] and [5] for some 
examples. 

We would like to emphasize that the extremogram is a conditional measure of extremal serial 
dependence. Therefore it is particularly suited for financial applications where one is often interested 
in the persistence of a shock (an extremal event on the stock market say) at future instants of time. 
Another good reason for using the extremogram for financial time series is a statistical one: for large 
x, the quantities P{x^ 1 X} l G B \ x~ 1 Xq G ^4) are rare event probabilities; their non-parametric 
estimation cannot be based on standard empirical process techniques and requires large sample 
sizes. Fortunately, long financial time series are available and therefore the study of their extremal 
serial behavior is not only desirable but also possible. Financial time series often have the (from 
a statistical point of view) desirable property that they are heavy-tailed, i.e. extreme large and 
small values are rather pronounced and occur in clusters. The extremogram and its modifications 
discussed in this paper allow one to give clear quantititave descriptions of the size and persistence 
of such clusters. 

In estimating the extremogram, the limit on x in (|l.ip is replaced by a high quantile a m of the 
process. Defining a m as the (1 — l/m)-quantile of the stationary distribution of \Xt\, the sample 



A set C is bounded away from zero if C C {y : \y\ > r} for some r > 0. 
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extremogram based on the observations X\ , • • • , X n is given by 

En— h j 
,,.-,, ,, , *=1 1 {a m 1 X t+h eB,a^ 1 X t eA} 

i 1 - 2 ) PA,B{n) = ™ — ; • 

2-,i=l J {a' 1 A' t GA} 

In order to have a consistent result, we require m = m n — > oo with m/n — > as n — > oo. In 
practice, we do not know a m and therefore it has to be replaced by a corresponding empirical 
quantile, i.e., by one of the largest observations. Under suitable mixing conditions and other 
distributional assumptions that ensure the limit in (jl.ip exists, it was shown in Davis and Mikosch 
[7] that pA,B(h) is asymptotically normal; i.e., 

(1.3) y/n^(fiA,B(h)-PA,B m (h))AN(0,aX B (h)), 

where 

(1.4) PA,B-.m(h) = P{a^X h G B \ a^X E A) . 

We refer to (jl.4p as the pre-asymptotic extremogram (PA-extremogram) . 

There are several obstacles in directly applying (11 .3f) for constructing confidence bands for the 
extremogram: 

(i) The asymptotic variance cr AB (h) is based on an infinite sum of unknown quantities and 

typically does not have a closed-form expression, 
(ii) Estimating a\ B (h) is similar to estimating the asymptotic variance of a sample mean from 

a time series and is often difficult in practice, 
(iii) The PA-extremogram cannot always be replaced by its limit. 

For (i) and (ii), we turn to bootstrap procedures to approximate the distribution of (pA,B(h) — 
PA,B-.m(h)). This will allow us to construct credible (asymptotically correct) confidence bands for 
the PA-extremogram. As for (iii), a non-parametric bootstrap does not allow us to overcome the 
bias concern. We note, however, that the PA-extremogram is a conditional probability that is often 
the quantity of primary interest in applications. That is, one is typically interested in estimating 
conditional probabilities of extreme events as a measure of extremal dependence so that it is not 
necessary, and perhaps not even desirable to replace the PA-extremogram with the extremogram 
in (TOD . 

The objective of this paper is to apply the bootstrap to the sample extremogram in order to 
overcome these limitations. By now, there are many non-parametric bootstrap procedures in the 
literature that are designed for use with stationary time series. Many of these involve some form 
of resampling from blocks of observations. That is, in constructing a bootstrap replicate of the 
time series, long stretches of the time series are stitched together in order to replicate the joint 
distributions of the process. While for a finite sample size n, it is impossible to replicate all the 
joint distributions, we can only sample from at most the m-variate distributions (for m < n) by 
sampling blocks of m consecutive observations. In order to obtain consistency of the procedure, 
m is allowed to grow with n at a suitable rate. In this paper, we adopt the stationary bootstrap 
approach as described in Politis and Romano [18] in which the block sizes are given by independent 
geometric random variables. Since the blocks are of random length, the stationary bootstrap is 
useful as an exploratory device in which dependence beyond a fixed block length can be discovered. 

In our case, there are significant differences in the extremogram setting of our bootstrap appli- 
cation from the traditional one. First, the summands in the numerator and denominator of (jl.2p 
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form a triangular array of random variables and cannot be cast as a single stationary sequence. 
Second, most bootstrapping applications in extreme value theory, even in the iid case, require the 
replicate time series to be of smaller order than the original sample size n; see e.g. Section 6.4 
of Resnick [20]. On the other hand, we are able to overcome these drawbacks and show that the 
bootstrapped sample extremogram, based on the replicates of size n provides an asymptotically 
correct approximation to the left-hand side of (jl .31) provided the blocks grow at a proper rate. 

In addition to providing a non-parametric estimate of the nature of extremal dependence as a 
function of time-lag, the extremogram can also provide valuable guidance in various phases of the 
typical time series modeling paradigm. For example, the sample extremogram might provide insight 
into the choice of models for the data with the goal of delivering models that are compatible with 
the extremal dependence. In the standard approach, models are often selected to fit the center of 
the distribution and can be inadequate for describing the extremes in the data. On the other hand, 
if the primary interest is on modeling extremes, then the modeling exercise should focus on this 
aspect. The quality of fit could be judged by assessing compatibility of the sample extremogram 
with the fitted model extremogram. Moreover, the sample extremogram from the residuals of the 
model fit can be used to check compatibility with the lack of extremal dependence. 

Figure [TTT1 shows the sample extremograms of a GARCH(1,1) (left) and a stochastic volatility 
(right) process. The GARCH realization was generated from the model, 

(1.5) X t = a t Z t and of = 0.1 + 0.141^ + 0.84 of _! , 

where (Zt) is an iid sequence with common distribution given by a £4 (standardized to have variance 
1). The SV realization was produced from the model (Xt) satisfying 

(1.6) X t = a t Z t and log a t = 0.9 loga t _i + e t , 

where (et) is a sequence of iid standard normal variables and is independent of the iid sequence (Zt), 
which has a £2.6 distribution. These conditions ensure that both the GARCH and SV realizations 
are regularly varying with index a = 2.6, i.e. they have power law tails with index a; we refer to 
Section [2] for a precise description. For the calculation of the sample extremograms, samples of size 
n = 100,000 were used, the sets A = B = (1, 00) were chosen and, for a m , the .98 empirical quantile 
of the simulated data was taken. It is evident from the slower decay of the sample extremogram 
for the GARCH(1,1) process in Figure 11.11 that this process exhibits extremal clustering while 
the faster decay of the sample extremogram for the SV process indicates the lack of clustering. 
However, without a sense for the asymptotic distribution, it is virtually impossible to make any 
inferences about the extremogram (pre-asymptotic or otherwise) . Under the assumption of no serial 
dependence, one can compute permutation produced confidence bands (these are the solid lines in 
Figure II. ip . Clearly, the extremogram for the GARCH is significantly greater than the dashed 
line at height .02, which corresponds to the value of the pre-asymptotic value of the extremogram 
under the null hypothesis of independence. On the other hand, the extremogram for the SV 
process tails off after lag 18 and is not significantly different than the .02 value that one would 
expect for independent data. This is consistent with the theory described in Davis and Mikosch [7] 
in which there is extremal dependence for GARCH processes and none for SV processes. The use 
of permutation procedures is illustrated in more detail in Section 14.11 

The remainder of the paper is organized as follows. A brief interlude into the concept of regular 
variation on which the extremogram is built, is provided in Section [2j After establishing the theory 
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Figure 1.1. The sample extremogram for the upper tail for the GARCH(1, 1) (left) and SV processes 
(right), where the processes are specified in (|1.5[) and (ll.6[) . respectively. The sample size is n = 100,000 
and a m is the .98 empirical quantile. The solid horizontal lines are permutation-produced confidence bands 
and the dashed line at height .02 corresponds to the value of the PA- extremogram under independence. 



of the bootstrapped extremogram in Section [3l its use is demonstrated with several financial time 
series in Section^ In conjunction with the bootstrapped extremogram, we present a quick and clean 
method for testing significant serial extremal dependence using a random permutation procedure. 
This procedure is actually similar in spirit to using the block bootstrap procedure, but with block 
size equal to 1. The serial dependence is completely destroyed by randomly permuting the data so 
that the type I error for significance of the extremogram under the null of no serial dependence can 
be controlled. 

The cross-extremogram for multivariate time series is defined and illustrated for real data ex- 
amples including the returns of the major equity indices in Section [5j Like the univariate time 
series, the cross-extremogram will depend on two sets, often decided upon the practitioner. We 
use the cross-extremogram to provide a method for assessing extremal dependence between four 
major international stock markets, namely, the FTSE 100, S&P 500, DAX and Nikkei 225 Indices. 
Without controlling for the effect of changing volatility, the cross-extremogram gives significant 
dependence between the series for a large number of lags. However, after devolatizing each series 
using a GARCH model, the resulting extremogram shows significant dependence only at small lags. 
In addition, there is evidence of directionality: large values of one index follow another index. 

In their presentation [13], Geman and Chang |13j consider the waiting times between rare or 
extreme events for financial time series. They conclude from their analysis that there is evidence of 
significant extremal clustering. In Section 15.41 an extremogram that calculates these return times 
between extreme events is defined and the sample extremogram for various time series is provided 
as well. Consistent with the findings of Geman and Chang the presence of extremal clustering can 
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be detected easily using the bootstrap. The proof of the main theorems in Section [3] is provided in 
the Appendix. 

The reader who is mainly interested in applications of the sample extremogram to financial time 
series may skip the technical Sections [2] and [3J and directly go to Sections H] and [5j 

2. Brief Interlude into Regular Variation 

The extremogram (jl.ip is a limit of conditional probabilities and therefore not always defined. 
In this section we give a sufficient condition for its existence. The condition is rather technical; the 
interested reader is referred to Davis and Mikosch [7] for more details. We also mention that this 
condition is satisfied for some of the standard financial time series models such as GARCH and SV; 

see lam ei Eg using. 

In this paper we focus on strictly stationary sequences whose finite-dimensional distributions have 
power law tails in some generalized sense. In particular, we will assume that the finite-dimensional 
distributions of the (i-dimensional process (Xt) have regularly varying distributions with index a > 0. 
This means that for any h > 1, the radial part |Y/J of the lagged vector Y^ = vec(Xi, . . . , X^j is 
regularly varying with tail index —a : 

P{\Y h \>tx) 

„' r \ ^^t~ a asx^oo, t>0, 

P(\Yh\ > x) 

and the angular part Y^/I^l is asymptotically independent of the radial part \Yh\ for large values 
of \Yh\: for every h > 1, there exists a random vector 8(,£ §" rf_1 such that 

P(Y h /\Y h \ G ■ I \Y h \ > x) 4 P(Q h G •) as x -> oo. 



Here — > denotes weak convergence on the Borel a-field of S hd ~ l , the unit sphere in W hd with respect 
to a given norm | • |. The distribution P(®h E •) is called the spectral measure and a the index of 
the regularly varying vector Y^. We also refer to (Xt) as a regularly varying sequence with index a. 
For our purposes it will be convenient to use a sequential definition of a regularly varying sequence 
(Xt) which is equivalent to the definition above: there exists a sequence a n — >• oo, an a > and 
a sequence of non-null Radon measures (nh) on the Borel cr-field of M = K \{0} such that for 
h>l, 



(2.1) nP ( a -i Yh e-)^ 



MM 



where — > denotes vague convergence on the same cr-field; see Resnick [20] . Section 6.1. The limiting 
measures have the property Hh(t-) = t~ a / u/ l (-), t > 0. We refer to Basrak and Segers [3] who give 
an enlightening interpretation of the structure of a regularly varying sequence. 

Now to connect the measure in (|2.ip with the extremogram, for suitably chosen sets A and B 
in M. d bounded away from the origin, set for h > 2, A = A x R( fc_1 ) d and B = A x R( fc_2 ) d x B. 
Provided A and B are Hh continuity sets, then 

(2.2) PA , B (h - 1) = lim P(a^X h G B | a^X, € A) = lim "^.^ - S = ^ . 
n ^°° n ^°° nP(a n l Y h G A) /x h (A) 

It is worth noting that standard arguments in regular variation allow one to replace a n in the first 
limit appearing in (|2.2p with any sequence of numbers x n tending to oo. 
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3. The Bootstrapped Sample Extremogram 

In this section we will construct confidence bands for the sample extremogram based on a boot- 
strap procedure which takes into account the serial dependence structure of the data. The resulting 
confidence bands closely follow the sample extremogram. Moreover, assuming regular variation of 
the underlying time series, we will be able to show that the bootstrap confidence bands are asymp- 
totically correct. 



3.1. Stationary bootstrap. This resampling scheme was introduced by Politis and Romano [18J. 
It is an adaptation of the block bootstrap which allows for randomly varying block sizes. 

For any strictly stationary sequence (Yj) the stationary bootstrap procedure consists of generating 
pseudo-samples Y* , . . . , Y* from the sample Yl , . . . , Y n by taking the first n elements from 

(3-1) Y Kl ,... , Yxx+Li-u--- ,Yr n ,--- ,Yk n +l n -i , 

where (Ki) is an iid sequence of random variables uniformly distributed on {1, . . . , n}, (Li) is an iid 
sequence of geometrically distributed random variables with distribution P(L\ = k) = p (1 — p) 1 , 
k = 1, 2, . . . , for some p = p n G (0, 1) such that p n — > as n — > oo, and 

N = N n = mi{i > 1 : Li H \- U > n} . 

The upper limits of the random blocks {Ki, . . . , Ki+Li — 1} may exceed the sample size n. Therefore, 
in (|3.ip we replace the (unobserved) Yj's with t > n by the observations Y t mo d n- Finally, the three 
sequences (Yt), (Ki) and (Li) are also supposed independent. The dependence of the sequences 
(Ki) and (Li) on n is suppressed in the notation. The generated pseudo-sample can be extended 
to an infinite sequence (Y t *) by extending (13. lh to an infinite sequence. For every fixed n > 1, (Y t *) 
constitutes a strictly stationary sequence. 

3.2. Main results. We want to apply the stationary bootstrap procedure to the strictly stationary 
sequence of the indicator functions It = Is a -^x eC}> * e ^> wnere t ne underlying sequence (X t ) is 
strictly stationary W 1 - valued and regularly varying with index a, a m has the interpretation as a 
high quantile of the distribution of \Xq\, and C is a set bounded away from zero. The application of 
the stationary bootstrap in this context is rather unconventional since the sequences (It) constitute 
triangular arrays of strictly stationary sequences: through a m these sequences also depend on n. 

We write (7 f *) for a bootstrap sequence generated from the sample I\, . . . , I n by the stationary 
bootstrap procedure described above. In what follows, P*, E* and var* denote the probability 
measure generated by the bootstrap procedure, the corresponding expected value and variance. 
This means that P*(-) = P(- \ (Xt)) is the infinite product measure generated by the distributions 
of (Ki) and (Li). 

The bootstrap sample mean I n = n _1 Ya=i A* satisfies the following elementary properties: 

n 

E*(T n ) = E*(I{)=l n = n- 1 Y J h, 



sl = ^*(n 1 ' 2 f n ) = C n (0)+2Y,a-h/n)(l-p) h C n (h) 

h=l 
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where 

n 

C n (h) = n _1 ^(/j -l n )(I i+h -l n ), h = 0, . . . ,n, 

i=l 

are the circular sample autocovariances. Here we again made use of the circular construction 

I 3 = Ij mod n- Writing 

n—h 

7 n(/i) = n- 1 Y^iU ~ In)(k+h -In), h>0, 
i=l 

for the ordinary sample autocovariances, we have 

(3.2) C n (h)=jn(h)+jn(n-h), h = 0,...,n. 

Recall the notion of a strictly stationary regularly varying sequence from Section [5J in particular 
the sequence of limiting measures (/Jh); see (|2.ip . For convenience, we write /i = fj,\. For any subset 
CcR , define the quantities 

oo 

(3.3) a 2 (C) = n(C) + 2Y,T h (C), 

with Th{C) = Hh+i{C xl x C) and, suppressing the dependence on C in the notation, 

P m = ml n , p = EI 1 = P(a m 1 X 1 £C), 

Poh = E(I I h ) = P(a^ 1 X GC,a- 1 X h eC), h>\. 

We also need the following mixing condition: 

(M) The sequence (X n ) is strongly mixing with rate function (at)- Moreover, there exist m = 
m n — > oo and r n — > oo such that m n /n — > and r n /m n — > and 



(3.4) lim m n } a h = , 

n— >oo i! — » 

7i=r„ 

and for all e > 0, 

oo 

(3.5) limlimsupm n > P(|X^| > ea m , |X | > ea m ) = . 

fc^oo n ^oo T - f 

h=k 

Our next goal is to show that the stationary bootstrap is asymptotically correct for the boot- 
strapped estimator of P m given by 

n 

m 



S* T* V^ T* 

P m = mI n = —2_^I t 



n 
t=i 



The following result is the stationary bootstrap analog of Theorem 3.1 in [7]. It shows that the 
bootstrap estimator P^ of P m is asymptotically correct. 

Theorem 3.1. Assume that the following conditions hold for the strictly stationary regularly vary- 
ing sequence (X t ) of "R. d -valued random vectors: 
(1) The mixing condition (M) and in addition 

oo 
(3.6) 2, ka>k < co . 

h=l 
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(2) The growth conditions 
(3.7) P = Pr, 



0, and np /m 



oo. 



(3) The sets C and Cxi '" x C C M are continuity sets with respect to \i and Hh+i 

for h > 1, C is bounded away from zero and o~ 2 (C) > 0. 



1/9"" 

(4) The central limit theorem, (n/m) ' [P m — mpo) 
Then the following bootstrap consistency results hold: 

(3.8) E*{P* m ) 

(3.9) ms 2 n = var*((n/m) 1/2 P^) 
with o~ 2 (C) given in (|3.3p . In particular 

(3.10) P*(\P* m -n(C)\>5) " 
and the central limit theorem holds 



N(0,a 2 {C)) holds. 



M(C), 



a 2 (C) , 



0. 



5>0, 



(3.11) 



sup 



P*((n/ 



m 



,V2, 



ms: 



\)-V\P* m -P m )<x)-®{a 



0, 



where <£ denotes the standard normal distribution function. 



Politis and Romano |18| proved a corresponding result for the sample mean of the stationary 
bootstrap sequence (X£) for a finite variance strictly stationary sequence (X t ). They also needed 
the growth conditions p n — > and np n — > oo. Our additional condition (np n )(p n /m) — > oo, which 



'{a^XteC}' 



implies np„ — > oo is needed since P^ is an average in the triangular scheme If 
t = 1,. . . ,n. Although various steps in the proof are similar to those in Politis and Romano 
the triangular nature of the bootstrapped sequence (It) requires some new ideas. We found it 
surprising that the full stationary bootstrap works in this context. In the context of extreme value 
statistics the bootstrap often needs to be modified even when the data are iid. 

We now turn our attention to the sample extremograms for which both the numerator and 
denominator are estimators of the type P m . Therefore our next objective are the asymptotic 
properties of these ratio estimators which we study in a general context. We consider general sets 



D 1 ,...,D h C M and C 



D 



h+l 



c 



Hb 



h > 1. Since we deal with several sets D; we need to 



indicate that the indicator functions It depend on these sets: 

and we proceed similarly for the estimators P m (Di) of fi(Di). Now we define the corresponding 
ratio estimators 



P m {Di) J2t=i I {a^ 1 x t eD i } . 
Pc,Di = — = -^^ — . i = li 



h. 



P m (C) 22t=l {{a^XteC} 

Davis and Mikosch [7j, Corollary 3.3, proved the joint asymptotic normality of these estimators: 
Note that there is a misprint for the expression for ro i £> . in |7J , which we now correct here as 



h=l 



■j x ^ 



^'xAJ + mitAxli^x^]. 



10 THE EXTREMOGRAM 

The centering in the central limit theorem of Corollary 3.3 of [7j uses the PA-extremogram as 
opposed to the extremogram. In general the PA-extremograms cannot be replaced by their limits 

MA) . 1 , 

unless the following additional condition holds 

(3.12) lim ^h^[fi{D i )P{a^ 1 X o eC)-p:{C)P{a^ 1 X o eD i )} = 0, i = l,...,h, 

n— >oo 

In addition to the complex form of the asymptotic variance which can hardly be evaluated, condition 
(|3.12p points at another practical problem when applying the central limit theorem to the ratio 
estimators. The next result will show that these problems will be overcome by an application of 
the stationary bootstrap. 

We construct bootstrap samples /*(£)»), . . . , I*(Di), i = 1, . . . , h+1, from the samples I\{Di), . . . , 
I n (Di), i = 1, . . . ,h + l, by a simultaneous application of the stationary bootstrap procedure, i.e., 
we use the same sequences {K{) and {Li) for the construction of the h + 1 bootstrap samples; see 
Section I3TT1 From the bootstrap samples the bootstrap versions P^{Di) of P m {Di), i = 1, . . . , h + 1, 
and jo^ D . of pc,D t , i = 1, ■ ■ ■ ,h, are constructed. 

The following result shows that the bootstrapped ratio estimators 'Pc D are asymptotically cor- 
rect estimators of their sample counterparts pc,£)» 5 i = 1,. ■ ■ ,h. 

Theorem 3.2. Assume that the following conditions hold for the strictly stationary regularly vary- 
ing sequence {Xi) of M. d -valued random vectors: 

(1) The mixing conditions (M) and (j3.6|) hold. 

(2) The growth conditions (|3.7p on p = p n hold. 

(3) The sets D\, . . . ,Dh+i, Dj x R " x Di C M are continuous with respect to p and 
Pi+i, i = 1, . . . , h + 1, p{C) > and a 2 {Di) > 0, i = 1, . . . , h. 

(4) The central limit theorem {n/m) 1 ' 2 {pc,Di — PC,Di:m) = i u ~^ N{0,H) holds, where the 
asymptotic variance is defined in Corollary 3.3 of [7] . 

Then the bootstrapped ratio estimators satisfy the following bootstrap consistency result 

(3.13) P*{\?c,D i -Pc,D i \>S)$0, 5>0, 
and the central limit theorem holds 

(3-14) P*{{n/mfI\? c>Di - Pc,D t ) i=1 _ h ei)' <&o,s(A) , 

where A is any continuity set of the normal distribution $o,s with mean zero and covariance 
matrix E. 

3.3. Consistency for the bootstrapped sample extremogram. Recall the definition of the 
sample extremogram {pAB{i)) from f)1.2[) . This estimate can be recast as a ratio estimator by 
introducing the M ( +1 '-valued vector process 

(3.15) Y t = vec{X t ,...,X t+h ), t€Z, 

consisting of stacking h+1 consecutive values of the time series {Xt). Now the sets C and Dq, . . . , Dh 
specified in Theorem 13.21 are defined through the relations C = A x R , Dq = AdB xM and for 
Di = A X K x B x Rq % for i > 1. With this convention, Theorem 13.21 can be applied to the 
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(Y t ) and (Y t *) sequences directly. We formulate here the consistency result for the bootstrapped 
sample extremogram 

En—i j 
- ,.. (=1 J {« m I x i *eA, am 1 x; +i eB} 

Pab(i) = v^j — j , «>0. 

2U=i J {a-, 1 x*eA) 

We use the same notation as in Theorem 13.21 

Corollary 3.3. Assume that the conditions of Theorem \3.S\ are satisfied for the sequence (Yt) and 
the sets C, Dq, . . . , D^ defined above. Then, conditionally on (X t ), 

{n/mf 2 {p* AB (i) - PAB(i)) l=0X _ h 4 7V(0, £) . 

For later use, we will mention a result for the return times extremogram (p A (i)) given by the limit 
relations (|5.2p for a fixed set AcK bounded away from zero and the corresponding return times 
sample extremogram (p A (i)) defined in f)5.3[) . The bootstrapped return times sample extremogram 
(p* A (i)) is defined in the straightforward way by replacing (X t ) by the stationary bootstrap sequence 
(X£). We again use the vector process (Yj) defined in f|3. 15|) and sets C = A x IR and Di = 
A x (A c y~ l x Ax M. ', i = l,...,h, to recast (pA(i))i=i,...,h an d (/5^(»))t=i,...,/i> ft- > 1, as 
ratio estimators. Then Theorem 13.21 yields central limit theorems for the corresponding sample 
extremogram and its bootstrap version which we omit. These results show that the stationary 
bootstrap is asymptotically correct for the considered return times extremogram. 

4. Examples of the Bootstrapped Sample Extremogram 

The first application of the bootstrapped extremogram is to the 6,443 daily log-returns of the 
FTSE 100 exchange from April 4, 1984 to October 2, 2009. The sample extremogram of the FTSE 
for lags 1 to 40 corresponding to the left tail (^4 = B = (— oo, — 1)) with a m equal to the negative 
of the .04 empirical quantile is displayed as the bold lines in both panels of Figure 14.11 In the left 
graph, the dashed lines represent .975 and .025 quantiles of the sampling distribution of p^ A B (h) 
based on 10,000 bootstrap replications for the daily log-returns of the FTSE. The dotted line is the 
mean of the bootstrapped replicates. The bootstrapped extremogram decays slowly to zero which 
signifies extreme serial dependence. 

The right graph in Figure l4~T1 shows approximate 95% confidence intervals (dashed lines) for the 
PA-extremogram that are found using the appropriate cutoff values from the empirical distribution 
of the bootstrapped replicates of p A B (h) — pA,B(h) and the sample extremogram (dark solid line). 
Notice that due to the bias in the bootstrapped distribution, the sample extremogram does not 
fall in the center of the intervals. Using a small p n in the bootstrapped replicates helps reduce 
this bias. Observe that the horizontal solid line at height .04, corresponding to a PA-extremogram 
under an independence assumption, is well outside these confidence bands confirming the serial 
extremal dependence. 

The next example illustrates how the stationary bootstrap for the sample extremogram performs 
as a function on the choice of the mean block size. Recall from Section [3] that the condition p n — > 
is needed in order to achieve consistency of the bootstrap estimators of the extremogram. Figure I4T21 
(top left) shows the sample extremogram of the left tail for the 5-minute log-returns of Goldman 
Sachs (GS) from December 1, 2004 to July 26, 2006. We choose sets A = B = (-oo, -1) and the 
negative of the .01 empirical quantile of the log-returns for a m . The remaining graphs in Figure I4T21 
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Figure 4.1. Left: The .975 and .025 empirical quantiles (dashed lines) of 10,000 bootstrapped replicates of 
the extremogram for the daily log-returns of the FTSE; the sample extremogram (solid line) and the mean of 
the bootstrapped replicates (dotted line). Right: 95% confidence bands (dashed lines) for the PA- extremogram 
based on a bootstrap approximation to the sampling distribution to the sample extremogram (solid line) and 
the PA- extremogram (horizontal line at .04) based on the data being independent. 



show the boxplots of the sampling distribution for bootstrap replicates of the sample extremogram 
at just the most interesting lags of 1, 79, and 158. 

The sample extremogram has a large spike at lags 79 and 158. The New York Stock Exchange 
(NYSE) is open daily from 9:30am to 4pm. Hence, there are 78 5-minute spells each day and so we 
can conclude that there is evidence of strong extremal dependence between returns a day apart. In 
the top right graph {p n = 1/50) the .975 quantiles of the distribution of the bootstrap at lags 79 
and 158 do not reach the same height as in the bottom right graph {p n = 1/200). By resampling 
blocks with mean block size 50, the assumption is made that the dependence in observations Xf 
and X t+ k for k > 50 has little, if any, impact on the distribution of the sample extremogram. In 
particular, the dependence structure is broken for lags greater than 50 and so the bootstrapped 
sample extremogram cannot capture the extent of the extremal dependence at lag 79 and beyond, 
and certainly not at lag 158. 

The fixed block bootstrap is a method that resamples blocks of data of fixed length in order to 
keep the dependence structure intact. If such a bootstrap with fixed block size 50 was used, the 
opportunity to detect the extremal dependence at lags 79 and 158 would be impossible. Thus, the 
randomly chosen block size is a potential advantage of the stationary bootstrap: it is not unlikely 
to get blocks of size 90, say, if the mean block size is 50. As the mean block sizes increase from 50 
to 100 to 200, the stationary bootstrap captures more and more of the dependence at lags 79 and 
158. 
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Figure 4.2. The sample extremogram for the 5-minute log-returns of GS (top left). Boxplots of the cor- 
responding bootstrapped replicates of the extremograms at lags 1, 79 and 158, using a mean block size of 50 
(top right), 100 (bottom left) and 200 (bottom right). 



4.1. A random permutation procedure. We use a permutation test procedure to produce 
confidence bands for the sample extremogram under the assumption that the underlying data are 
in fact independent. These bounds can be viewed as the analogue of the standard ±1.96/ ' \fn bounds 
used for the sample autocorrelation function (ACF) of a time series. Values of the sample ACF that 
extend beyond these bounds lend support that the ACF at the corresponding lags are non-zero. 
The bounds for the sample ACF are based on well-known asymptotic theory. Unfortunately, such 
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bounds are not easily computable for the sample extremogram PA,B{h)- For a fixed lag h, if the 
value of the sample extremogram for the original data is not extreme relative to the values of the 
sample extremogram based on random permutations of the data, then the sample extremogram 
is impervious to the time order of the data. On the other hand, if the f>A,B{h) is more extreme 
(either larger or smaller than all the extremograms computed for 99 random permutations of the 
data), then we conclude the presence of extremal dependence at lag h with probability .98=98/100. 
Aside from boundary effects, (i.e., the numerator of the sample extremogram is a sum over n — h 
terms and hence depends mildly on h), the permutation distribution of the sample extremogram 
is virtually the same for all lags h. The bold lines in the graphs of Figure 11.11 correspond to the 
maximum and minimum of the sample extremogram at lag 1 based on 99 random permutations of 
both the GARCH (left panel) and the SV (right panel) models. The dashed line is the value of 
the PA-extremogram under the assumption that the data are in fact independent. In this case the 
value is .02. Clearly, the extremogram is measuring extremal dependence in both series. 

4.2. Equity indices. For the next example we consider daily equity index log-returns for four 
countries: the United States, the United Kingdom, Germany and Japan. Here and in what follows, 
the indices are left in their local currencies. The top left and right graphs in Figure l4~3l show the 
sample extremograms for the negative tails {A = B = (— oo, —1] with a m estimated as the absolute 
value of the .04 empirical quantile) applied to 6,443 daily log-returns of the FTSE 100 and S&P 500 
Indices from April 4, 1984 to October 2, 2009, respectively. The bottom left and right graphs in 
Figure H31 show the analogous sample extremograms applied to 4,848 daily log-returns of the DAX 
Index from November 13, 1990 to October 2, 2009 and to 6,333 daily log-returns of the Nikkei 225 
Index from August 23, 1984 to October 2, 2009, respectivelyo The daily log-returns were calculated 
from the daily closing prices. Notice that the extremograms for all four indices decay rather slowly 
to zero, with S&P the slowest. Again the solid horizontal lines are 98% permutation produced 
bounds. Interestingly, the first lag in the sample extremogram for three of the indices (FTSE, DAX 
and Nikkei) is smaller than the second lag. Among the four indices, the Nikkei displays the least 
amount of extremal dependence as measured by the extremogram. 

The top graphs in Figure 14.31 indicate extremal dependence in the lower tail over a period of 
40 days. Typically, financial returns are modeled via a multiplication model, given by Xt = crtZti 
where (Zt) is an iid sequence of mean variance 1 random variables and for each t, the volatility 
at is independent of Z t . It is often assumed that (Zt) is heavy-tailed as well. Most financial time 
series models such as GARCH and SV, have this form. With such a model, an extreme value 
of the process occurs at time t if a% is large or if there is a large shock in the noise (i.e., Zt) at 
time t. After estimating the volatility process (ot), the estimated devolatilized process is defined 
by Zt = Xtjot- If the multiplicative model is a reasonable approximation to the return series, 
then the devolatilized series should be free of extremal dependence. For each of the four indices, 
FTSE, S&P, DAX, and Nikkei, (denoted by X tl ,X t2 ,Xt 3 and X u , respectively) a GARCH(1,1) 
was used to devolatilize each of the four component series. Let Zt% = Xti/au,i = 1, ... ,4, be the 
respective devolatilized series. Figure 14.41 shows the sample extremograms and bounds produced 
by the permutation procedure of the left tail for the filtered series Zt\ and Zfi corresponding to 



As noted in the literature, the lower tails of returns tend to be heavier than the upper tails. Similar plots (not 
shown here) of the extremogram for the upper tails also reveal extremal dependence, but to a lesser extent than seen 
in the lower tails. 
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Figure 4.3. The sample extremogram for the lower tails of the FTSE (top left), S&P (top right), DAX 
(bottom left) and Nikkei. The bold lines represent sample extremograms based on 99 random permutations 
of the data. 

FTSE and S&P. Here we used A = B = (—00, —1) and the negative of the .04 empirical quantile for 
a m . These plots confirm that the extremal dependence as measured by the extremogram has been 
removed. Results for the filtered DAX and Nikkei are very similar. Thus the extremal dependence 
in the series (X t i) is due solely to the persistence in the volatility series. 



5. Further Extensions. 
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Figure 4.4. 27ie sample extremograms for the filtered FTSE (left) and the filtered S&P (right). The bold 
lines represent sample extremograms based on 99 random permutations of their respective filtered series. 



5.1. Cross-extremogram for bivariate time series. While the definition of the extremogram 
in (ll.ip covers the case of multivariate time series, it is implicit that the index of regular variation 
is the same across the component series. For example, consider two regularly varying time series 
(X t ) and (Yt) with tail indices a\ and 02 with ot\ < 02- Then the bivariate time series ((X t , lt)')teZ 
would be regularly varying with index a± and 

lim P(x~ X Y t+h € B \ x' X X t 6 A) = 0. 

x— >oo 

In this case, the extremogram involving the Y t series would be zero and there would be no extremal 
dependence between the two component series. To avoid these rather uninteresting cases and obtain 
a more meaningful measure of extremal dependence, we transform the two series so that they have 
the same marginals. In extreme value theory, the transformation to the unit Frechet distribution 
is the most standard. For the sake of argument in this discussion, assume that both Xt and Yj are 
positive so that the focus of attention will be on extremal dependence in the upper tails. The case 
of extremal dependence in the lower tails or upper and lower tails is similar. Under the positivity 
constraint, if F\ and F2 denote the marginal distributions of Xt and Yj, respectively, then the two 
transformed series, X t = Gi(X t ) and Y t = G2(Y t ), have unit Frechet marginals (F(x) = exp{ — 1/x}, 
x > 0), where Gi{z) = — l/log(Fj(z)). Now assuming that the bivariate time series ((X t , Yt)')tez 
is regularly varying, we define the cross-extremogram by 

p A B (h) = lim Pix^Yh e B I x^Xq eA), h>0, 

' x— )-oo 

where A and B are sets bounded away from 0. 

At first glance, this may seem unpleasant since transformations to unit Frechets are required. 
However if one restricts attention to sets A and B that are intervals bounded away from or finite 
unions of such sets, then since n is the (1 — l/n)-quantile of a Frechet distribution, we have by 
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the monotonicity of the transformation Gj, {n _1 A/j G ^4} = {o~ v-X^ ^ ^} and { n_1 ^/i 6 ^} = 
{a~yY/j G -B}, where a Uj x and a n) y are the respective (1 — l/n)-quantiles of the distributions of 
Xt and Y%. So as long as the sets A and B have this form, the cross-extremogram becomes 

(5.1) p A>B (h) = lim P(a~ Y Y h G B \ a^X G A) . 

n— >oo ' > 

The point of this observation is that we do not need to actually find the transformations converting 
the data to unit Frechets, only the component-wise quantiles, a n ,x and a n ,Y> need to be calculated. 
Clearly, this notion of extremogram extends to more than two time series. 

5.2. Sample cross-extremogram for bivariate time series. As argued in Section 15. l\ the 
cross-extremogram for the bivariate time series ((Xt,Yt))t£Z is 

(5.2) PA,b(H) = lim p A ,B:m(h) = lim P(a^ Y Y h G B | a^ x X G A) , h > 0, 

where a m) y and a mi x are the (1 — m/n)-quantiles of the distributions of \Y t \ and \X t \, respectively, 
and A and B are finite unions of intervals that are bounded away from 0. (In cases where we 
only explore the upper or lower tails, we then choose a mj XiO>m,Y to be either the m/n or the 
(1 — m/n)-quantiles of the respective distribution functions.) For the sake of simplicity, we use 
the same symbol Pa,b(^) as before, abusing notation and refer to PA,B-.m as the pre- asymptotic 
cross- extremogram. 

Starting from (|5.2p . we define the sample cross-extremogram for the time series {(Xt,Yt))t£_z by 



(5.3) p AtB (h) 



En—h j 
t=1 H^yYt+heB.a^XteA} 



E"=i I 



i^.x x ^M 



where a m ^x and a m y are replaced by the respective empirical quantiles computed from {X t )t=i,...,n 
and (Y t )t=i,..., n , respectively. 

We are interested in the extremal serial dependence in the left tail for the pairs of equity index 
log-returns discussed in Section 14.21 Therefore we calculate the sample cross-extremograms for 
these pairs with sets A = B = (— oo, —1) and the negative .04 empirical quantiles of the returns for 
a m: x and a m y, respectively. 

In calculating the cross-extremograms between the daily indices (Xu) and {X t j), we only consider 
days t for which we have observations on both indices. Since FTSE, S&P, DAX, and Nikkei are 
indices from four different countries, there is not a perfect overlap on when the corresponding 
markets are open. The cross-extremograms between (Xu) and (Xtj) exhibited a similar pattern 
of slow decay as seen in the univariate sample extremograms. It was thought that the strong 
cross-extremal dependence could be an artifact due to the persistence and synchronicity in the 
marginal volatilities. This phenomenon is similar in spirit to the well-known issue for the cross- 
correlation function of linear bivariate time series. In this case, unless one or all of the component 
time series have been whitened, the cross-correlation may appear to be significant (see Chapter 
11 in Brockwell and Davis [3]). To explore this phenomenon, we computed the cross-extremogram 
for the devolatilized components. Figure 15.11 shows the sample cross-extremogram between the 
estimated residuals, Zu and Ztj for i ^ j corresponding to the four indices. For example, in the 
first row of graphs, (X t ) is the filtered FTSE (Z tl ) and (Y t ) are the filtered S&P (Z t2 ), DAX (Z t3 ) 
and Nikkei (Zu), respectively. Each graph contains permutation generated confidence bands. 
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Interestingly, there are signs of various types of cross-extremal dependence in the filtered series. 
The spike at lag zero (except between the Nikkei and S&P) indicates the extremal dependence in 
the shocks Zu and Z t j for i,j = 1,2,3. This is not surprising since we would expect dependence 
(extremal or otherwise) between the devolatized series obtained from univariate G ARCH (1,1) fits to 
each of the marginal series. In the second row, there is evidence of significant extremal dependence 
at lag one for each sample cross-extremogram: given the S&P has an extreme left tail shock at time 
t there will be a corresponding large left tail shock in the FTSE, the DAX and the Nikkei at time 
t + 1. Given the dominance of the US stock market, one might expect a carry-over effect of the 
shocks on the other exchanges on the next day. Since only marginal GARCH models were fitted to 
the data, it may not seem all that surprising that the filtered series exhibit serial dependence. We 
should note, however, that the dependence in the shocks does not appear to last beyond one time 
lag. 

We also computed the extremogram between (X t \) and (Zu) for i = 2,3,4. These plots (not 
shown) were virtually identical to those displayed in Figure 15.11 which suggests that only one of the 
components in the cross-extremogram needs to be devolatized. 

While this analysis was carried out on the left tail, the right tail (not included) shows very 
similar patterns. However, the degree of dependence is different: the left tail extremal dependence 
probability is greater than in the right tail. 

5.3. Cross-extremogram for trivariate time series. For a stationary trivariate regularly vary- 
ing time series ((Xt, Yt, Zt))tez, many different variations of the cross-extremogram can be defined 
depending on the context. We focus on the extremograms 

(5.4) Pl (h) = lim P({Y h > x} U {Z h > x} | X > x) , h>0, 

x— >oo 

(5.5) p 2 (h) = lim P(Z h > x | {Y > x} U {X > x}) , h>0. 

x—>oo 

Similar to the discussion in Section 15.11 one needs to replace the thresholds x by the quantiles of 
the marginal distributions if these are not identical. The sample cross-extremograms corresponding 
to (|5.4p and (|5.5p . respectively, are then defined as 

- , t=l * {X t >a mA and (Y t+h >a m , 2 OT Z t+h >a m ,3)} , \ n 

PlW = v=^n 7 > K>U, 

l^t=l 1 {X t >a mtl } 

En—h t 
t=i 1 {(x t >a m . 1 or Y t >a m , 2 ) and Z t+h >a m . 3 )} , 
P2{n) = F=^n — f , n>0, 

Z_jt=\ 1 {X t >a m ,i or Y t >a m:2 } 

am,ii i = 1)2,3, are chosen as the corresponding empirical quantiles of the X t 's, Yj's and Zf's, 
respectively. 

Figure 15.21 shows the sample cross-extremograms corresponding to (|5.4|) and (|5.5|) . In both 
graphs, (Xt), (Yt) and (Zt) represent the 5-minute log-returns of Bank of America (BAC), Citibank 
(CBK) and Microsoft (MSFT) from December 1, 2004 to July 26, 2006, respectively. Here, the 
absolute values of the .96 empirical quantiles of the negative returns for the series (Xt), (Yt) and 
(Z t ) are used for a m> i, a m ^ and a m> 3, respectively. In the left graph of Figure |5T2| one can interpret 
the spike at lag zero as the probability of obtaining an extreme return in either CBK or MSFT now 
given that there is an extreme return in BAC now. The spike at lag one has the interpretation as 
the probability of obtaining an extreme return in either CBK or MSFT now given that there was 
an extreme return in BAC 5 minutes ago. The interpretation of the spikes as lags and 1 in the 
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Figure 5.1. The sample cross-extremograms for the filtered FTSE, S&P, DAX and Nikkei series. For the 
first row, (X t ) is the filtered FTSE and (Y t ) are the filtered S&P, DAX and Nikkei (from left to right). For 
the second, third and fourth rows, the X t 's are the filtered S&P, DAX and Nikkei series, respectively. 



right graph is analogous. In both plots the decay is fast. If one wanted to utilize the information 
about the extremal serial dependence between the series one would have to act fast. 



5.4. The extremogram of return times between rare events. In their presentation 
Geman and Chang consider the waiting or return times between rare (or extreme) events for 
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Figure 5.2. The sample cross-extremograms for (|5.4[) (left) and (|5.5p (right) for the 5-minute log-returns 
X t ,Y t ,Z t of BAC, CBK and MSFT, respectively. 



financial time series. They define a rare event by a large excursion relative to observed returns: a 
return Xt is rare (or extreme) if Xt < £ p or Xt > £i— p , where £ q is the g-quantile of the distribution 
of returns. Typical choices for p are 0.1 and 0.05. Denoting occurrences of rare events by the binary 
sequence 



(5-1) 



W* 



1, if Xj is extreme, 
0, otherwise, 



Geman and Chang study return times Tj, j = 1, 2, . . ., between successive l's of the Wj sequence. 
If the return times were truly iid, the successive waiting times between l's should be iid geometric. 
Using the histogram of waiting times, the geometric assumption can be examined. In order to 
perform inference and in particular hypothesis testing on the extremal clustering of returns, Geman 
and Chang [13] calculate the observed entropy of the excursion waiting times and compare it to 
the entropies of random permutations of the excursion waiting times. If the observed entropy 
behaves similarly to the random permutations then one may conclude that the returns do not 
exhibit extremal clustering. 

We now introduce an analog of the extremogram for the return times between rare events in a 
strictly stationary regularly varying IR^-valued sequence (Xt). Denoting the rare event by A C R , 
the corresponding return times extremogram is given by 



(5.2) 



p A (h) = lim P(Xi xA, ... , X h ^ x xA, X h G xA \ X G xA) , h > 1 . 



Using the regular variation of the sequence (Xt) (see Section [2]) and assuming that A and A x 
Rq x A are continuity sets with respect to ji and /x/i+i, h > 1, and A is bounded away from 
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zero, we can calculate the return times extremogram 

fi w (ixl»x^) 
p A (h) = ^ , h>0. 

The return times sample extremogram is then defined as 

En—h t 
(5.3) /^(/i) = =R — , ft = 1,2, ...,n-l. 

2U=1 i {a„ 1 X t e J 4} 

An asymptotic theory for the return times sample extremogram and its bootstrap version is given 
at the end of Section T3.31 This theory shows that the stationary bootstrap is asymptotically correct 
for this sample extremogram. 

We will now illustrate some examples of the return times of extreme events. The graphs in Figure 
15.31 show the histograms for the return times of extreme events for the daily log-returns of Bank of 
America (BAC) for different choices of the rare events A: in the left graphs A = K\[^o.05;Co.95] an d 
in the right graphs A = (— oo,£o.i)- The top row contains the histogram (solid vertical lines) of the 
log-returns, whereas the bottom row shows the corresponding histograms for the filtered time series 
after fitting a GARCH(1,1) model to the data; cf. Section [4.21 Since the heights of the histogram 
correspond exactly to the return times sample extremogram, we can apply the bootstrap procedures 
of Section HI The dashed lines that overlay the graphs in Figure 15.31 represent the .975 (upper) 
and .025 (lower) confidence bands computed from the bootstrap approximation to the sampling 
distribution of the sample extremogram. The solid curve is the geometric probability mass function 
with success probability p = 0.1. As seen in these plots, the geometric probability mass function 
falls outside the confidence bands at nearly every value. This complements earlier findings of the 
presence of serial extremal dependence in the original daily returns. 

6. Appendix: Proofs 

6.1. Proof of Theorem [370 Notice that E*(P* t ) = P m . Now it follows from Theorem 3.1 in 
Davis and Mikosch [7] that 

(6.1) P m 4 fi(C) . 



This proves (|3.8|) . 

Next we prove (I3.9h . i.e., we study the asymptotic behavior of ms 2 n . Write It = It — Po and 



C n (h) = n 1 '^2l i I i+hl h = 0,...,n, 



i=l 

n—h 



j n (h) = n 1 ^2 hk+h , h = 0, . . . , n - 1 , 

i=i 

where the I/s in the summands are again defined circularly, i.e., Ij = Ij mo &n- Since 

n 

C n {h) = n- 1 Y, h h+h ~ (In? = C n {h) +p 2 - (I n f 



i=l 



and from the central limit theorem in (4) of Theorem 13.11 and (j6.ip , 

(n/m) l ^m((l n ) 2 -p 2 )= [{n/m) 1 ' 2 (P m - mp )] [m" 1 (P m + m p )] =0 P { m - 1 ) 
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Figure 5.3. The histograms (solid vertical lines) for the return times of extreme events for the daily log- 
returns of BAC using bootstrapped confidence intervals (dashed lines), geometric probability mass function 
(light solid) for A = K\[£ .05, Co. 95] (left) and A = (-00, £0.1) (right). 



we have 



n-l 



ms„ — m 



C n (0) + 2 5^(1 - h/n) (1 - p) h C n {h) 



h=\ 



n-l 



m((/„) 2 -p§) l + 2£(l-fc/n)(l-p) h 



h=\ 



3.2) 



P ((mn)- L/2 p- L ) = o P (l) 
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In the last step we used assumption (|3.7p . It follows from (|3,2p and Lemma 5.2 in Davis and 
Mikosch [7] that 



3.3) 



We also have by assumption (J3.5 



m C„(0) 4 u(C) and m C n {h) 4 T h (C) , h>\. 



.4) lim lim sup E 1 

fe— >°o n—too 



m 



J2(l-h/n)(l-p) h C n (h) 



h=k 



< lim lim sup m > poh = . 

fc->oo n-5-oo ~£ 



Combining ()6.2|) f)6.4|) and recalling the definition of o" 2 (C) from (J3.3J) . it suffices for (J3.9P to show 
that for every 5 > 0, 



lim lim sup P ( m 



n-1 



£(l-fc/n)(l-p) ft C n (/0 



h=r„ 



><H =0. 



We have 



where 



Then 



n— 1 n—r„ 

m Y J (l-h/n)(l-p) h C n (h)= Y, b n{h)ln{h) + op{l)=T n + o p {\), 

h=r n h=r n 



b n (h) = m\(l- h/n) (1 - p) h + (/t/n) (1 - p) 



n—h 



n-r n n—r n 



var(T n ) = ^ ^ kn(MM^2)cov(7 n (/ii),7 n (/j 2 )) 

h 1 =r n h,2=r„ 



n—1 n—1 



< y~] yZ K{hi)b n (h 2 ) max |cov(7 n (>i),7 n (/i 2 ))| 

, — ' , — r n <hi,h 2 <n-l 



3.5) 



hi=r n h,2=r n 
-2_2 



< cp m max |cov(7 n (/ii),7 n (/i 2 ))| 

r n <hi,h,2<n— 1 



Here and in what follows, c denotes any positive constants whose value is not of interest. We have 
for fixed k, 

n—h\ n—h,2 

Yl Yl cov(i t i t+hl ,i s i s+h2 ) 



t=l s=l 



m 2 |cov(7 n Oi),7 n (/i 2 ))| = (m/n) 2 

< cm 2 /n^|cov(/o4 1 ,^r4+fe 2 ) 



r=l 



= c(m/n)ml Y + Y + Y l cov ( J o4i, Irl r +h 2 ) 

\\r—hi\<k k<\r—hi\<r n \r—hi\>r n / 
= C (m/n) [Ji + J 2 + J 3 ] . 
Using the sequential definition of the regular variation of (Xt) (see ()2.ip ). we have for fixed k 
limsup J\ < limsup \, m [Po,\r-hi\ + Po\ < /, T /i(C) ^ a (C) < °° • 



|r-/ii|<fc 



h<fc 
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Next we use condition (13.51): 



lim limsupJ2 < lim limsupm > Po\r-hi\ + c nm mr nPo = 0. 

fc-KX) n ->oo " fc^oo n ->oo , . ,, ' n-J-oo 

fe<|r— fti|<r n 



Finally, the mixing condition (|3.4p yields 

lim sup J3 < c limsupm V, a |r-/n| = 0- 

n— >oo n— >oo 

r-/ii|>r n 

Thus we proved that 

m 2 |cov(7 n (/ii),7 n (/i 2 ))| < c(m/n) 

uniformly for hi,h,2 > ^n an d large n. We conclude from (|6.5p and assumption (|3.7j) that 

var(T n ) < cm/(np ) — >• . 

Using (|3.4p . it also follows that -ET n — )■ 0. Thus we proved that E(T 2 ) — > 0. Combining the bounds 
above, we conclude that (j3.9|) is satisfied. 

Next we prove the central limit theorem (|3.1ip . Since both sums P m and P m contain the 
same number of summands and we consider the difference (n/m) 1 ' 2 (P* — P m ) we will assume 
in what follows that all summands (rn/n)It in P m and P m are replaced by their centered versions 
(m/n)It = (m/n)(It — po). We write P m and P m for the corresponding centered versions. 

Write 

SKuLi = lKi~\ \-lKi+Li-l, i = l,2,..., 

(6.6) S n N = Sk 1 ,l 1 ~\ + Sk n ,l n ■ 



Lemma 6.1. Under the conditions of Theorem \3.1[ 

(6.7) P*{(n/m) 1/2 \P m - (m/n)S nN \ > 5) -»• 0, 5 > 0. 



Proof. By the argument in Politis and Romano |18j on p. 1312, using the memoryless property of 
the geometric distribution, {m/n)S n N — Pm has the same distribution as (rn/ri)SKx,Li with respect 
to P* . Hence it suffices for (|6.7|) to show that (m/n)E*(\SK 1 ,L 1 \ 2 ) — > 0. An application of Markov's 
inequality shows that the latter condition is satisfied if 

(6.8) (m/n)E(\S KlM \ 2 )^0. 

We have by stationarity that 

(m/n)E(\S KlM \ 2 ) = (m/n)E(\h + ---+J L \ 2 ) 



= (m/n) ^2 var(Ii H h h) (1 - p) 1 1 p. 

1=1 

We have (sums over empty index sets being zero) for every fixed k,l > 1, 

l-l 
mvar(/i + • • • + 1{) = I m var(Ji) + 2m / _,(! — h/l) cov(Jq, Ih) \ 

h=i 

= I mvar(Ii) + 2m( ^ + ^ + ^ )(1 - h/l) cov (I ,I h ) 



Kh<k k<h<r n r n <h<l—l 
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The right-hand side is 0(1) uniformly for I by virtue of regular variation of (X t ) and in view of the 
mixing condition (M). Since np — > oo we have 

oo 

{m/n)E{\S Kl M\ 2 ) < en" 1 ^ / (1 - p)^ 1 ^ < c(np)" 1 -> Q . 



i=i 



This proves (I6.8P and hnishes the proof of the lemma. 



□ 



In view of (JS2D it suffices for (13T1J) to show that {n/m) l / 2 [{m/n)S nN - P m ] 4 N(0,a 2 (C)) 
conditional on (Xj). Recall that E*(Sk 1 ,l 1 ) = P~ X (In —po)- Then 

N 
(6.9) (n/m) 1 ' 2 \(m/n)S nN - P m ] - (m/n) 1 / 2 £(5*,^ - p-\7 n - p )) 



i=l 



Jnp L 



p 

J- 77 



The quantities ((N — np)/y/np) are asymptotically normal as np — > oo and have variances which 
are bounded for all n. Therefore these quantities are stochastically bounded in P*-probability. 
Moreover, the quantities (n/m) 1 / 2 P m have bounded variances in P-probability, and therefore 

(np)- 1 /2[( n / m )V2p m ]4 . 
These arguments applied to (|6.9p yield for 5 > 0, 

N 



P* 



(n/m) x ' 2 [(m/n)S n N - P m ] - (m/n) 1 / 2 ^(Sk^ -p~\l n ~ Po)) 



> 5 ) 4 . 



Therefore it suffices to prove that 

N 



(m/n) 1 / 2 J2(S KiM -p~\l n -p )) A N(0,a 2 (C)) 



4 = 1 



conditional on (Xj). Now, an Anscombe type argument (e.g. Embrechts et al. [12], Lemma 2.5.8) 
combined with the asymptotic normality of ((N — np)/y/np) as np — > oo show that the random 
index N in the sum above can be replaced by any integer sequence I = t n — > oo satisfying the 
relation np/£ —> 1. Given (Xj), the triangular array 

(m/n^iSK^-p^CIn-po)), i = !,...,£, n = l,2,..., 

consists of row-wise iid mean zero random variables, hence it satisfies the assumption of infinite 
smallness conditional on (Xj). Therefore it suffices to apply a classical central limit theorem for 
triangular arrays of independent random variables conditional on (Xj). In view of (|3.9p . cr 2 (C) 
is the asymptotic variance of the converging partial sum sequence. Thus it suffices to prove the 
following Lyapunov condition conditional on (Xj): 

(m/nf/ 2 £E*(\S KuLl -p~ x (l n - Po )f) ~ m 3 / 2 ^ 1 / 2 P E*{\S KlM -p~ x (l n - p )| 3 ) 4 . 

An application of the C r -inequality yields 

m z l 2 n- x l 2 pE*(\S KlM -p- x (! n -po)?) 



< 4m 3 / 2 n- 1 / 2 



P 



E*l\S KlM \ a )+ p - a \I n -po 



4m 3 / 2 n- 1/2 pE*(\S KuLl \ 3 ) + A(np)- 2 \(n/m) x/2 P n 
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Since the last expression is op(l) it suffices to show that 

|3\ p , 



(6.10) 



m 3 / 2 n^l 2 pE*(\S KlM \ 3 )^Q. 



An application of Markov's inequality shows that it suffices to switch to unconditional moments in 
the last expression. Writing Si = I\ H + ij, we have by stationarity of (Xi) 

oo 

E(\S KlM \ 3 ) = E(\S Ll \ 3 ) =Y,E(\S l \ 3 ) (l-pY-'p. 

1=1 

Next we employ a moment bound due to Rio [21], p. 54, 

E(\Si\ 3 ) <3sf + 144/ / [a~ 1 (x/2) /\ I] 2 Q 3 (x) dx , 
Jo 

where a -1 denotes the generalized inverse of the rate function a(t) = aw, Q is the quantile function 
of the distribution of |ii| = \I\ — po\ and 



i i i-i 

*? = YY, l cov ^' J j)l = ^o (1 - Po) + 2 J3(/ - /i) |p h 



•rfl 



For every fixed k > 1, 

5? < (i/m) 



3 7"ri « J- 

'"/'o + 2[^+ ^ + Y, ]0--h/l)m\p oh 

h=l h=k+l h=r„+l. 



n 2 \ 

Pol 



where sums over empty index sets are zero. Using regular variation of (Xi) and the mixing condition 
(M), we conclude that the right-hand side is of the order 0(1 /m) uniformly for I. Hence 



m 3/2 n l/2 p 



E-fCl-jO'- 1 * < cn^ 2 pYl" 2 (l-p) 1 - 1 



P 



i=i 



i=i 



< c(np)~ 1/2 ^0. 

Direct calculation with the quantile function of \It\ shows that 

1 rl-po /-i 

(a~ 1 (x/2) M) 2 Q 3 (x)dx = p 3 \ [a~ x (x/2) A l] 2 dx + (1 - p ) 3 / [a" 1 (a;/2) A I} 2 dx 
n Jo Jl-po 

< c [m~ 3 Y ka k + m" 1 ] = 0(m~ l ) . 
k=i 



In the last step we used condition (I3.6p . Combining the estimates above, we obtain 



m ' n 



l l 2 pY E (\Sl?)(l-p) l ~ X P < c[(np)- 1 / 2 + (m/n)- 1 / 2 }^0. 



l=i 



This proves relation (|6.10p and concludes the proof of the theorem. 
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6.2. Proof of Theorem 13.21 From (|3.10p we know that 
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.11) 



P*(|P^(A)-/i(A)|>«5)4 0, 5>0, 



l,...,h + l, 



therefore (13.13P follows. 

Relation (I6.1ip implies that for each % = 1, . . . , h, in P*-probability, 



Pc,D t ~ PC,D, 



P^(Di)P m (C) - P* m {C)P m {D % 



Pm\C)Pm\C) 



1 + Op(l) 



M (C)(P^(A) - P«(A)) - M(A)(i^(C) - P m {C)) 



Therefore it suffices for the central limit theorem (|3.14p to prove a multivariate central limit theorem 
for the quantities P^( A) — Pm{ A), « = 1, . . . , /i+ 1. We will show the result for /i = 1; the general 
case is analogous. It will be convenient to write D = A an d C = A- 



iV(0,E), 



Lemma 6.2. T/ie following central limit theorem holds in P* -probability 

V P* m {C) - P m (C) J 
where the asymptotic covariance matrix is given by 
( o 2 (D) T DC \ 

' -HO J ' 



TDC 



r D c = fi(CnD) + J2[^+i(P ) 



Rf 2) xC) + /i W (Cxlf 2) xD)] 



j=l 



Proof. We show the result by using the Cramer- Wold device, i.e. 

z'S n 4 iV(0, z'Ez) , zeR 2 . 

We indicate the main steps in the proof in which we follow the lines of the proof of Theorem 13.11 
We observe that E*(z'S n ) = 0. Next we show that, conditional on (Xt) 

var*(z'S„) = (n/m) [z?W(P^(D)) + ^var*(P^(C)) + 2 Zl z 2 cov* (P^(C) , P^(D)) 



(6.12) 



z'Sz. 



p , Jii 



K ji, 



By d3HD, (n/m)var*(P^(L>)) 4 a 2 (D) and (n/m)var*(P^(C)) 4 ct 2 (C). Hence it suffices to show 
that 



(6.13) 

We observe that 



(n/m)cov*(P^(C),P^(D)) 4 r DC ■ 



cov* (P^(C), P* m {D)) = - [™*{P* m {C) + P* m (D)) - var*(P^(C) - i* (£>)) 

Observe that P„(C) ± P^(I>) contain the bootstrap sequences I t *(C) ± I t *(A) = (I t (C) ± I*(A))*, 
£ = 1, . . . , n. Therefore the same ideas as for Lemma 5.2 in Davis and Mikosch [7\ and in the proof 
of Theorem 13.11 above apply to show (|6.13p . We omit the details. 
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It immediately follows from Lemma 16.11 and the argument following it that the multivariate 
central limit theorem can be reduced to the central limit theorem for the triangular array 

(m/n) l / 2 [z 1 (S Ki , Li (D)-p-\l4D)-p (D)))+z 2 (S KiM (C)-p- 1 (l n (C)-p (C))) 
i = l,...,£, n = 1,2,... , 
(6.14) 

where £ = £ n satisfies the relation np/£ — > 1. This array consists of row- wise iid mean zero random 
variables, conditional on (X t ). Relation (|6.12p yields the correct asymptotic variance for the central 
limit theorem of the quantities (|6.14j) . Therefore it again suffices to apply a Lyapunov condition of 
order 3 to the summands (|6.14p conditional on (Xt). However, an application of the C r -inequality 
yields that, up to a constant multiple, this Lyapunov ratio is bounded by the sum of the Lyapunov 
ratios of Skx,Li(C) and Sk 1 ,l 1 (D) which, conditional on (X t ), were shown to converge to zero in 
the proof of Theorem 13.11 This finishes the sketch of the proof of the theorem. □ 
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